Learning Communities in the Presence of Errors
نویسندگان
چکیده
The Stochastic Block Model or the Planted Partition Model is the most widely used probabilistic model for community detection and clustering graphs in various fields, including machine learning, statistics, and social sciences. Many existing algorithms successfully learn the communities or clusters when the data is drawn exactly according to the model, but they do not work well in the presence of errors. In this talk, I will address the following question: Can we design robust polynomial time algorithms for learning probabilistic models for community detection that work in the presence of adversarial modelling errors? I will present robust algorithms for (partially) recovering communities or clusters in graphs drawn from the Stochastic Block Model, in the presence of modeling errors or noise. These algorithms allow two types of adversarial errors: edge outlier errors (where an adversary can corrupt an arbitrary fraction of the edges), and any amount of Feige-Kilian or monotone errors. Mossel, Neeman and Sly (STOC 2015) posed an open question about whether an almost exact recovery is possible when the adversary is allowed to add o(n) edges. Our work answers this question affirmatively even in the case of k>2 communities. Finally, I will describe how our algorithms recover the clusters, even when the modelling error is captured using Kullback-Leibler (KL) divergence: these algorithms work when the instances come from any distribution of graphs that is m close to the Stochastic Block Model in the KL divergence (this result also handles adversarial errors). This is based on joint work with Konstantin Makarchev and Yury Makarychev. Organizer(s): Eric Allender, Pranjal Awasthi, Michael Saks and Mario Szegedy
منابع مشابه
On Presentation a new Estimator for Estimating of Population Mean in the Presence of Measurement error and non-Response
Introduction According to the classic sampling theory, errors that are mainly considered in the estimations are sampling errors. However, most non-sampling errors are more effective than sampling errors in properties of estimators. This has been confirmed by researchers over the past two decades, especially in relation to non-response errors that are one of the most fundamental non-immolation...
متن کامل“Professional Learning Communities (PLC): An Effective Strategy to Improve Teachers’ Self-Efficacy”
Nowadays almost all schools fail to develop a process in line with supporting the professional development of teachers. Professional learning communities intend to formulate a framework and process for teachers’ continuous learning and professional development. The present research aims to investigate the impact of Professional Learning Communities (PLC) on the teachers’ self-efficacy. This res...
متن کاملAn Analysis of Social Presence and Cognitive Presence in Discussion Forum
An increase of asynchronous online discussions in website provides much opportunity for L2 learners from different global communities to be exposed to the target language at their own pace and time. However, no research looking at the essentials of social presence and cognitive presence in creating a supportive learning environment in such a context has been done. This study investigated the pa...
متن کاملDetecting Overlapping Communities in Social Networks using Deep Learning
In network analysis, a community is typically considered of as a group of nodes with a great density of edges among themselves and a low density of edges relative to other network parts. Detecting a community structure is important in any network analysis task, especially for revealing patterns between specified nodes. There is a variety of approaches presented in the literature for overlapping...
متن کاملA Multiagent Reinforcement Learning algorithm to solve the Community Detection Problem
Community detection is a challenging optimization problem that consists of searching for communities that belong to a network under the assumption that the nodes of the same community share properties that enable the detection of new characteristics or functional relationships in the network. Although there are many algorithms developed for community detection, most of them are unsuitable when ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016